CDS
Accession Number | TCMCG075C00843 |
gbkey | CDS |
Protein Id | XP_007047556.2 |
Location | complement(join(3611869..3612019,3612351..3612423,3612867..3613026,3613310..3613421,3614021..3614118,3614234..3614657,3615317..3615468,3615718..3615926,3616227..3616314,3616396..3616752)) |
Gene | LOC18611300 |
GeneID | 18611300 |
Organism | Theobroma cacao |
Protein
Length | 607aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007047494.2 |
Definition | PREDICTED: probable amino-acid acetyltransferase NAGS1, chloroplastic [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | E |
Description | Amino-acid acetyltransferase |
KEGG_TC | - |
KEGG_Module |
M00028
[VIEW IN KEGG] |
KEGG_Reaction |
R00259
[VIEW IN KEGG] |
KEGG_rclass |
RC00004
[VIEW IN KEGG] RC00064 [VIEW IN KEGG] |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko00002 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] |
KEGG_ko |
ko:K14682
[VIEW IN KEGG] |
EC |
2.3.1.1
[VIEW IN KEGG]
[VIEW IN INGREDIENT] |
KEGG_Pathway |
ko00220
[VIEW IN KEGG] ko01100 [VIEW IN KEGG] ko01110 [VIEW IN KEGG] ko01130 [VIEW IN KEGG] ko01210 [VIEW IN KEGG] ko01230 [VIEW IN KEGG] map00220 [VIEW IN KEGG] map01100 [VIEW IN KEGG] map01110 [VIEW IN KEGG] map01130 [VIEW IN KEGG] map01210 [VIEW IN KEGG] map01230 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGATGGCCGCTTCATATTCAACGGCTCGCGTCCCTCTCTTCTCTCCCGCTCGAACCAAACTCTTATCATCCCGCCACGGCTTCAAAAAGGGCGTCGTAAAACTAAAACCCGACTTGAAGTGCCGGGCTCAGTCTCTTAAACCGGAACCCGGGTCCAAGCGAGGTGATTCTGTCAAGCGCAATGTGATTAACGATGAAGATAGCGTGGAGGAGACTTACAACACCGTCGACGATAAGCAGTTCGTGCGGTGGTTCCGCGAGGCTTGGCCTTACCTCTGGGCCCATCGCGGCAGCACTTTCGTTGTTATTATTTCCGGCGAAATCGTCGCTTCCCCCTCTTTGGACGCCATTTTAAAGGATATTGCGTTTTTGCATCACCTAGGAATCAGATTTGTTATTGTTCCAGGAACTCACGTGCAGATCGACAAGCTTTTGGCCGAGAGAGACCATGAACCAAAGTATGTAGGCAGATATAGAATTACAGACTCAGAATCTCTAGCTGCAGCAATGGAAGCAGCAGGAGGGATTCGTCTAATGATAGAGGCAAAACTTTCTCCTGGACCTTCCATATGTAATATCCGTCGACATGGTGATAGTAGCCGTTGGCATGAAGTTGGTGTCAGTGTTGCTAGTGGAAACTTCCTTGCAGCTAAGAAAAGAGGAGTTGTTGAAGGTGTTGATTATGGAGCAACAGGTGAAGTAAAGAAGGTAGATGTTGCTCGCATGCGTGAGAGGCTTGACGGTGGTTGTATAGTAATATTAAGCAACCTGGGGTATTCTAGCTCTGGAGAAGTTTTGAATTGCAACACATATGAAGTTGCTACTGCTTGTGCATTAGCTATTGGAGCAGATAAGCTGATTTGCATTATAGATGGTCCAATTTTGGATGAGAATGGACGCCTTATTAATTTCTTGCCTCTTCAAGAAGCAGATATGTTAATCCGTCAACGGGCTAAGCAAAGCGAGACAGCAGCTAAATATGTGAAAGCTGTTGATGAAGAAGATGTCACTTGCCTTGGACATTATGATTCTATTGCAGTTGTCCCCTCTTCACAGAATGGGAAGGTTCTTAATAGTACACACAATCCAACCTTTCAGAATGGTGTTGGTTTTGATAATGGCAATGGACTATGGTCTGGAGAGCAGGGCTTTGCTATTGGAGGTCAGGAGCGGCTAAGTCGACTAAATGGCTACCTTTCAGAGTTGGCTGCTGCCGCTTTTGTCTGCAGAGGTGGTGTCCAAAGAGTTCATTTGTTAGATGGCACTATTGGTGGGGTCTTATTATTGGAACTGTTCAAAAGAGATGGAATGGGGACAATGGTGGCCAGTGATCTATATGAAGGTACCCGGATGGCGAAGGTGATGGATCTCTTAGGTATCAAGCAAATCATACAACCTTTAGAAGAGTCTGGCACATTGGTTTGCAGGAGTGATGAGGAGCTACGTAAGGCCATAGATTCATTTGTTGTTATGGAAAGGGAAGGTCAAATCGTTGCTTGTGCTGCTCTTTTTCCTTTTTTCAAGGACAAGTGTGGGGAAGTTGCTTGTATTGCAGTTTCTCCTGAATGCCGAGGACAAGGACAGGGAGACAAATTACTTGATTACGTAGAGAAGAAGGCATCATCCCTTGGATTGGATATGCTTTTCCTGCTGACAACCCGTACTGCTGATTGGTTTGTTAGGCGCGGCTTCGAAGAATGTACCATTGACATGATACCAGATGAAAGGAGGAAAAAGATCAATCTATCCCGTAAATCCAAGTATTACATGAAGAAGTTGCTACCGGATCGAAGTGGAATTACTGCTGATAGAGCATTTAAATGA |
Protein: MMAASYSTARVPLFSPARTKLLSSRHGFKKGVVKLKPDLKCRAQSLKPEPGSKRGDSVKRNVINDEDSVEETYNTVDDKQFVRWFREAWPYLWAHRGSTFVVIISGEIVASPSLDAILKDIAFLHHLGIRFVIVPGTHVQIDKLLAERDHEPKYVGRYRITDSESLAAAMEAAGGIRLMIEAKLSPGPSICNIRRHGDSSRWHEVGVSVASGNFLAAKKRGVVEGVDYGATGEVKKVDVARMRERLDGGCIVILSNLGYSSSGEVLNCNTYEVATACALAIGADKLICIIDGPILDENGRLINFLPLQEADMLIRQRAKQSETAAKYVKAVDEEDVTCLGHYDSIAVVPSSQNGKVLNSTHNPTFQNGVGFDNGNGLWSGEQGFAIGGQERLSRLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRMAKVMDLLGIKQIIQPLEESGTLVCRSDEELRKAIDSFVVMEREGQIVACAALFPFFKDKCGEVACIAVSPECRGQGQGDKLLDYVEKKASSLGLDMLFLLTTRTADWFVRRGFEECTIDMIPDERRKKINLSRKSKYYMKKLLPDRSGITADRAFK |